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In a simulated measurement of the W-boson mass, evaluation of Fisher's information shows the 
optimal jet definition £jj to yield the same precision as the &t algorithm while being much faster at 
large multiplicities. 
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1. Association of hadronic jets observed in high energy 
physics experiments with quarks and gluons in the un- 
derlying collisions of quanta provides an experimental 
handle on fundamental interactions via the so-called jet 
finding algorithms that find a configuration of jets Q, 
represented by the -/Vj Cts 4-momenta pj, for a given event 
P, represented by the iV part light-like 4-momenta p a : 



P = {Pa} 



jet algorithm 



(i) 



Unless jets are energetic and well separated, jet definition 
involves ambiguities that were seen to be a major, even 
dominant source of errors in the planned experiments Q . 

A well-known requirement on possible jet finding al- 
gorithms is the infrared safety |4| or insensitivity of Q 
to collinear fragmentations of particles in P. It is clar- 
ified by a theorem of ref. || expressing fragmentation- 
invariant observables in terms of the energy-momentum 
tensor defined by space-time symmetries uniquely, so 
that such observables can be equivalently represented in 
terms of either hadron, or quark and gluon fields. How- 
ever, the requirement leaves much freedom for the map- 
ping l[T|l. and many jet algorithms emerged over time. 

2. Ref. |4| introduced so-called cone algorithms that 
define a jet as all particles in a cone of a fixed radius 
. Cone axes are usually found iteratively to be directed 
along jets' 3- momenta, and cone overlaps are treated with 
ad hoc prescriptions. The fixed shape of cones enhances 
the stability of cone algorithms and facilitates studies 
of detector corrections, but decreases the jet resolution 
power. 

Ref. introduced a definition based on the shape ob- 
servable thrust ||, as theoretical studies are easier with 
such observables. Here one minimizes the sum 



(2) 



where Tj is the thrust for the j-th jet. (Similar mea- 
sures were considered e.g. in 32] and a special case of jet 
search based on an optimization of a shape observable 
was also employed e.g. in |33j].) However, the required 
minimization was deemed unfeasible 0. 

Successive recombination algorithms emerged with a 
motivation to invert hadronization [lOj. Here one starts 



with a list of particles, computes a "distance" d a b for each 
pair of particles, and replaces the pair with the smallest 
d a b by a single pseudo-particle with p a b = p a + Pb- One 
repeats this until e.g. all d a b exceed a given threshold y C ut 
or only a given number of (pseudo-) particles remain in 
the list. Possible d a b are given by 



d 2 



ab 



E a E b {E a + E b ) n (l-cos0 a6 ) . 



(3) 



where E a and Eb are the particles' energy fractions, 9 a b 
is the angle between them, and y cut is the so-called jet 
resolution parameter, n = and n = 2 correspond to the 
JADE [llj and Geneva |l2j criteria. It was also argued 
that the dynamics of the 2 — > 1 amplitude in QCD is 
matched best by the so-called hj; measure [l3|: 



d 2 

a ab 



min (E 2 ,E 2 ) (l-cos0 a 



(4) 



Such algorithms find jets of irregular shapes. Ref. 0] 
replaced 2 — > 1 recombinations with a global n — > m one 
(but still based on pairwise distances d a b), yielding more 
regular jets but this is more expensive computationally. 

The multitude of available jet algorithms — often dif- 
fering in obscure details — caused their comparative 
studies (e.g. refs. @, [l^, E3, 0)- The subject's im- 
portance has been growing along with the drive towards 
higher precision in the jet physics 0, (lif. 

3. Ref. reinterpreted the physically significant am- 
biguities of jet algorithms due to algorithmic variations 
as instabilities which a correct measurement procedure 
must be free of. The resulting theory , provided a 
context to derive an optimal jet definition from explicit 
physical motivations. The principal points of the theory 
are as follows: 

(i) Calorimetric measurements with hadronic final 
states P must rely on observables / (P) that possess a 
special " calorimetric" , or C-continuity which is a non- 
perturbative generalization of the familiar IR safety (see 
[T7| for details) and which guarantees a stability of / (P) 
against distortions of P such as caused by detectors. Ref. 
[17] pointed out C-continuous analogues for a variety of 
observables usually studied via intermediacy of jet al- 
gorithms. The fundamental role of such observables is 
highlighted by two facts: (1) An observable inspired by 
|17| played an important role in the selection of top quark 
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events in the fully hadronic channel at DO [l^, H3- (2) 
The Jet Energy Flow project [21| provides numerical ev- 
idence that C-continuous observables may indeed help to 
go beyond the intrinsic limitations of conventional pro- 
cedure based on jet algorithms in the quest for the 1% 
precision level in the physics of jets. 

(ii) The proposition that the observed event P inherits 
information (as measured by calorimetric detectors) from 
the underlying quark-and-gluon event q is expressed as 

/ (q) ~ / (P) f° r an y C— continuous /. (5) 

(iii) For each parameter M on which the probabil- 
ity distribution ttm CP) of the observed events P may 
depend, there exists an optimal observable / opt (P) = 
dMlniTM CP) for the best possible measurement of M 
|l8| . This is a reinterpretation of the Rao-Cramer in- 
equality and the maximal likelihood method of mathe- 
matical statistics in terms of the method of moments. 
In the context of multi-hadron final states as "seen" by 
calorimetric detectors, such an observable is automati- 
cally C-continuous. 

(iv) If the dynamics of hadronization is such that eq. 
(0) holds, then good approximations for / opt could ex- 
ist among functions that depend only on Q which is a 
parameterization of P in terms of a few pseudo-particles 
(jets), found from a condition modeled after eq. JSJ): 

/ (Q) ~ / (P) f° r an y C-continuous /. (6) 

This simply translates the meaning of jet finding as 
an inversion of hadronization into the language of C- 
continuous observables. 

(v) C-continuous observables can be approximated by 
sums of products of simplest such observables that are 
linear in particles' energies: 

/(P) =£a3./ (ft,)- (?) 

(The relevant theorems can be found in refs. and[I3.) 

(vi) So it is sufficient to explore the criterion (JSJ) with 
only /'s of the form Q. Then one can perform a Taylor 
expansion in angular variables and obtain a factorized 
bound of the form 

|/(P)-/(Q)|<C /ifl xfi K [P,Q], (8) 

where all the dependence on / is localized within C f t R 
(so the bound remains valid for any C-continuous /) and 

n R [P, Q] = R- 2 Y [P, Q] + E soft [P, Q] , (9) 

where Y [P, Q] = 2 £\ PjQj, E soft \P, Q] = z a E a , and 
R > is a free parameter (see ref. |lj for a discussion) . pj 
are jets' physical 4- momenta expressed as pj = J2 a z ajPa, 
where the so-called recombination matrix z a j is such that 
< z a j < 1 and z a = 1 — J2j z aj > for any a, i.e. a 
part of the particle's energy is allowed to not participate 



in any jet. cjj are light-like 4-vectors related to pj and 
given by qj = (l,Pj - /|pj|) for lepton collisions (qj can be 
definied differently for hadron collisions; see ref. Q for 
details). The recombination matrix z a j occurs naturally 
in the construction of the bound © and is the funda- 
mental unknown in this scheme. Y in © differs from 
in that the jet's physical momentum is used in place of 
the thrust axis. -Esoft is the event's energy fraction that 
does not take part in jet formation, (vii) Since the col- 
lection of values of all / on a given event P is naturally 
interpreted as the event's physical information content, 
the bound © means that the distortion of such content 
in the transition from P to Q can be controlled by a sin- 
gle function; so the loss of physical information in the 
transition is minimized if Q corresponds to the global 
minimum of . The Optimal Jet Definition amounts to 
finding z a j which minimizes 51^, depending on specific 
application, either with a given number of jets or with a 
minimum number of jets while satisfying the restriction 
Q,r [P, Q] < ^cut with some parameter uj cu t > which is 
similar to the jet resolution y cu t of recombination algo- 
rithms. 

4. OJD combines attractive features of the different al- 
gorithms reviewed above and is free of their defects (see 
ref. for more details): (i) OJD is based on a shape 
observable, (ii) It finds jets of rather regular shapes with 
angular radii bounded by R. (iii) it resolves jet overlaps 
dynamically, depending on the global structure of the 
event's energy flow, (iv) uj cu t bounds the soft energy in 
the physically preferred totally inclusive fashion (cf. ref. 
0. (v) OJD is purely analytical, allowing its algorith- 
mic implementations to differ beyond programmatic code 
optimizations and to be customized for specific applica- 
tions, (vi) OJD is embedded in a systematic theory with 
new options for constructing improved data processing 
procedures that go beyond the conventional approach. 

5. Despite the huge dimension of the domain in 
which to search the global minimum, A part x JVj ets — 
0(100-1000), OJD lends itself to efficient algorithmic 
implementations (the Optimal Jet Finder library 22]). 

OJF was first developed in the programming language 
Component Pascal j24j, featuring a unique combination 
of safety and efficiency. This was very useful for the ex- 
perimentation needed to find a satisfactory algorithm. 
Only after that the final port to FORTRAN was per- 
formed. A subsequent testing j2{| and a substantially 
independent verification [2(| revealed no defects of signif- 
icance, indicating a high reliability of the resulting code 

M 

The OJF library can be used to obtain OJD imple- 
mentations adapted for specific applications (see below). 

6. A number of successive recombination algorithms 
were compared in ref. Q in a series of tests none of 
which, however, was conclusive. The JADE algorithm 
proved to be the least satisfactory, the Geneva algorithm 
behaved somewhat erratically, and a group of algorithms 
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(including for and Lucius) exhibited a balanced behavior 
in various tests, typically populating the spread between 
the JADE and Geneva algorithms. Note that the suc- 
cessive recombination scheme is recovered within OJD 
as a heuristic minimum-search trick with n = 1 in eq. 
[l^. which is the geometric mean of the JADE and 
Geneva criteria. Then OJD should roughly fall into the 
same group as the fcx and Lucius algorithms. A conclu- 
sive physically meaningful comparison can be performed 
in the context of the method of optimal observables. We 
explain the procedure using a simple example modeled af- 
ter the measurements of the IF-boson mass M at LEP2 
[2?j |. The details will be published separately 

The process e + e~ — > W + W~ — > hadrons at CM en- 
ergy of 180 GeV was simulated using PYTHIA 6.2 [jflj . 
Each event was resolved into 4 jets. These can be com- 
bined into two pairs (supposedly resulting from decays 
of the W's) in three different ways; we chose the com- 
bination with the smallest difference in invariant masses 
between the two pairs and calculate the average to of 
the two masses. This mapped events to the m axis. We 
used 9 • 10 6 events to generate the probability distribu- 
tion 7Tm(to) and to construct a numerical approxima- 
tion to the optimal observable / op t (to) = 8m In km (to) ■ 
Using this as a generalized moment with a sample of 
iVoxp experimental events would yield an estimate for 
M with the theoretically smallest error estimated as 

6M exp S (iV exp (/opt)) ) where (f% pt ) is sometimes 
identified with Fisher's information. <5M cxp immediately 
reflects suitability of the jet algorithm used. 

We thus compared OJD with the fcx and JADE defini- 
tions. We used the KTCLUS implementation of the for 
algorithm jsoj and modified the recombination criterion 
to obtain the JADE algorithm. All events were forced to 
4 jets, so the parameters y cu t and aj cut played no role. 

For OJD, we chose R = 2 and, for benchmarking pur- 
poses, first employed a primitive variant of OJF-based 
algorithm with a fixed ritries for all events, where ntries 
is the number of independent attempts to descend into 
a global minimum from a random initial configuration. 
The probability to miss the global minimum vanishes for 
larger ritrics! we chose ritries = 10. The obtained / op t (to) 
for the three jet algorithms are shown in fig. ^ 

For A cxp = 1000 (which roughly corresponds to the 
W-mass measurements at LEP2) we found the following: 
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118 



The error of 3 MeV is mostly due to numerical differen- 
tiation in M. 

Note that there are options to improve the measure- 
ment procedure that are specific to OJD, e.g. weighting 
events according to the values of FIr. We have not ex- 
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FIG. 1: Optimal observable / op t (m) for OJF, &t and JADE. 



plored them, as it is sufficient for the purposes of this 
Letter to establish that OJD is at least no worse than 
the for algorithm for this measurement. 

7. An important aspect is the speed of jet algorithms 
at large iV part . This is critical e.g. in the preclustering for 
reducing the number of clusters in each event as seen e.g. 
by the DO detector at FNAL to about 200: otherwise it 
is not possible to analyze data with for as its processing 
time per event is O (N pait ) j^lj. A concern then is how 
the preclustering affects the final results as it has to be 
done using a method unrelated to the &t algorithm, and 
a non-programmatic modification of the latter must be 
treated as a new jet definition (cf. examples in ref. Q). 

Speed of the algorithms as different as OJF and 
KTCLUS (coded in the same dialect of FORTRAN) may 
depend on the computing installation. With this in view, 
we report times per event in units of 10 -2 sec as mea- 
sured on our hardware with our sample of events. 

Npgxt varied from 50 to 170 in our sample, with the 
mean value of 83. The processing time per event is de- 
scribed rather well by the following formulae: 



1.2 x 10" 
1.0 x 10" 



X ^p 3 art 



for KTCLUS, 
2 x iV part x ntrios for OJF. 



(10) 



This behavior was verified for -/V part up to 1700 by split- 
ting each particle into 10 collinear fragments (similarly to 
how a particle may lit up several detector cells). The re- 
quired ntrics only depends on the number of local minima 
of £Ir that reflects the event's global structure (number, 
width of jets, etc.) but not on N pait . 

The simplest OJF-based implementation of OJD with 
a fixed n tr ies for all events is faster than KTCLUS for 
-Apart > 90^/ntrics- Note that the values above 7 for 

retries 

seem to be rarely warranted, and for a substantial frac- 
tion of events very low values are in fact sufficient. We 
have not explored this option, focusing instead on a more 
significant optimization described below. 

8. It is important to appreciate that whereas any mod- 
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ification of the fcx algorithm beyond an equivalent code 
transformation would have to be treated as an entirely 
new jet definition, OJD is formulated without reference 
to any specific implementation, so once a reliable mini- 
mization algorithm is found, it can be used to control the 
quality of other implementations designed for speed. 

Useful modifications result from allowing a misidenti- 
fication of the global minimum for a fraction of events, 
with the quality of the entire data processing procedure 
controlled via Fisher's information (/o pt ). A simple such 
optimization can be implemented entirely using the rou- 
tines from the OJF library; it relies on the well-known 
fact that the jet structure is often determined by the most 
energetic particles: Select the most energetic particles (a 
skeleton event), and precluster them by running the mini- 
mization routine. Then add the remaining particles with 
random values of z a j and run the minimization again. 
With a threshold of 2 GeV to select the energetic parti- 
cles, ntrios = 5 at the preclustering phase and n tr i CS =1 at 
the final stage, only a 1% change was observed for <5M cxp 
(curiously, an improvement) whereas the speed much in- 
creased, with the dependence of the time per event on 
-Apart now given roughly by 

2.5 x 1(T 2 x AT part (11) 

with a hint at a slower growth at large iV part . This is 
faster than KTCLUS starting from iV part ~ 140, and the 
speed advantage increases sharply for higher -/Vp art : for 
IVpart ~ 200 this is twice as fast as KTCLUS, and an 
extrapolation to A part ~ 1000 yields the factor of 50. 

The dramatically better behavior of OJF at large -/Vp ar t 
makes it a candidate for work at the level of detector cells, 
perhaps even on-line (note that all ntries minimization 
attempts can be done in parallel). 

The OJF library implements the first minimization al- 
gorithm found to run acceptably fast. Better algorithms 
may be found once the OJD/OJF is explored further. 

9. To summarize, a conclusive method to compare jet 
algorithms is based on evaluation of Fisher's information. 
In the considered model measurement, OJD is equivalent 
to the fcx definition in physical quality, and an imple- 
mentation of OJD is increasingly faster than KTCLUS 
at large A part starting from A part « 140. Moreover, 
OJD is defined in a theoretically preferred fashion and 
is supported by a systematic theory with new options for 
improvement of jets-based measurements. All this posi- 
tions OJD as a candidate for a standard jet definition for 
the next generation of HEP experiments. 
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